PCX world immjBcmwL raj^m organization
INTERNAHONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT)
(51) International Patent Cli
odflcatfon 6 ;
(11) International Pul
Mkatkw Number:
WO 96/36941
G06T3/60
Al
(43) International Pul
>Bcatkm Datet 21 No
vernber 1996 (21.11.96)
PCT/US96/0H18
sry 1996 (29.01.96)
15 Mty 1995 (15j05.95)
(72) Inventors: WOBER, Munib, A.; 6 Cliffe Avenue, Havernffl,
MA 01832 (US), HAJJ AHMAD, Ibrahim; Apartment #12,
262 School Street, Sommervllle. MA 02145 (US). RHSCH,
Michael. 53 Nathan Lane, Carlisle. MA 01741 (US).
SUNSHINE, Lot, B.; Apartment #6. 80 Revere Street.
Boston, MA 02114 (US).
(74) Agent: SABOURIN. Robert. A.; Polaroid Corporation. Patent
Dept. 3rd floor. 575 Technology Square, Cambridge, MA
02139-3589 (US).
(SI) Designated States: CA, JP, SO, European patent (AT, BE, CH,
DB, DK, ES. PR, OB. OR, IE, IT, LU, MC NL, PT, SB).
(54) Title! IMAGE ROTATION USING DISCRETE COSINE TRANSFORMS
An linage processing system for skewing or rotating an Image includes an Image
acquisition device, first, second and third memories, a matrix multiplier, control sequencer lock
and an output device. Image rotation is facilitated by rjerfonnmg a series *
on image data points representing the image In a spatial domain. A Bret shearing operation
produces » first sheared image by shearing the image. A second shearing operation produces s
second sheared image by shearing the first sheared image. A third trresrtag operation prc<hice«
a third sheared image corresponding to a routed image by shearing tbe second sheared irnsge.
For each shearing operation the image data points am transformed into DCT coefficients In
data points are generated by talcing a modified inverse discrete even cosine transform (IDCT) of
the DCT coefficients using a modified IDCT basis function frprndm upon either a horizontal
or vertical offset.
1 «-T*h|
000052785
F OR T HE P URPOS ES O QNfLQ YRMATI 0
WO 96/36941 PCT/US96/01418
Inaga rotation using dlscrtto cosine transforms.
This application is a continuation-in-part of U.S. Patent Application No. 08/159,795
filed 30 November 1993 by Munib A. Wooer and Michael L. Reisch. Furthermore, mis
application is related to concurrently filed and commonly assigned U.S. Patent
Applications, assignee nos. C-7940, C-7966, C-8004, C-8005, C-8009 and C-8036.
BACKGROUND OF THR INVTOMTTnM
1. Field of the Invention
The invention relates generally to an improved image processing system and
methods for use with this system. More particularly, the invention relates to a system and
methods thereto for producing a sheared or rotated image.
2. Description of the Prior Art
Images can be thought of as two-dimensional representations of some visual reality
that is distributed in space and/or time. Ordinarily, images are what the human visual
system perceives as variations in external stimuli such as brightness, color, and sometimes
; depth cues. While over the years many techniques have been developed to capture and
reproduce images, their representation as continuous, discrete, or digital signals which can
be manipulated, processed or displayed through the use of computers or other special
purpose electronic hardware is the most recent technique. Now well-established, this latest
technique has a variety of beneficial applications. For instance, while in electronic form,
images cart be enhanced to create special visual effects, restored, coded for transmission to
distant locations, stored in memory (such as on CDROM, DAT, floppy disks, etc.),
reconstructed, displayed, or converted to some other tangible form.
Image processing can occur in various domains such as the spatial domain or the
frequency domain. An image is said to reside in the spatial domain when the values of
the parameters used to describe it, such as luminance, have a direct correspondence with
spatial location. In the frequency domain, the image of the spatial domain may be
represented by a series of frequency components in the form of trigonometric functions
which, when summed for each image data point (Lc, pixel) of the spatial domain, yields
5 the value of the parameter used to characterize the image at mat particular image data
point in the spatial domain. Such a representation may be extended to cover all image
data points of an image.
In the spatial domain, original image data may be represented as a continuous
function of spatial position, designated s 0 (y,x) for the two-dimensional case. For most
10 applications it is acceptable, as well as advantageous, to sample this continuous-space
image along the horizontal and vertical directions at x-iT„ and y-jT v where i and j
are integer indices and T k and T, are the horizontal and vertical sampling periods,
respectively. This yields a matrix of points, aJ$TJR*> which shall be identified henceforth
with the discrete signal designated as s(j,i) for the two-dimensional case where the lower
15 case, s, designates the spatial domain, i is the index of rows, j is the index of columns,
and i and j can be initialized to start at zero. In the frequency domain, matrices can
also be used to mathematically describe an image as a set of transform coefficients (also
referred to as frequency coefficients) which represent frequency data in a transform matrix
conventionally designated, S(v,u), where die upper case, S, designates the frequency
20 domain, u is the index of rows and v is the index of columns.
Spatial image data points may be transformed to the frequency domain using
transformations such as Fourier transforms or discrete cosine transforms. The use of
discrete cosine transforms and inverse discrete cosine transforms for image compression is
well known in the art and, in fact, the practice has been adopted as standard in industry by
25 The Joint Photographic Experts Group (JPEG) and the Motion Picture Expert Group
(MPEG); which were created as part of a joint effort of the Consultative Committee on
International Telegraphy and Telephony (CCITT) and The International Standards
Organization (ISO).
When a discrete even cosine ttansformation (hereinafter DCT) is used, the
30 frequency domain is referred to as the DCT domain and the frequency coefficients are
referred to as DCT coefficients. Conventionally, transforming data from the spatial
domain to the frequency domain is referred to as a forward transformation, whereas
1
WO 96/36941 PCT/US96/01418
transforming data from the frequency domain to the spatial domain is referred to as an
inverse transformation. Hence, a forward discrete cosine transformation is defined as a
transform that maps an image from the original image data points sQ,i) in the spatial
domain to DCT coefficients S(v,u) in the DCT domain according to the basis function of
the forward DCT, whereas an inverse discrete even cosine transformation (or IDCT) is
defined as a transform that maps the DCT coefficients S(v,u) from the DCT domain to
reconstructed image data points 30,0 in the spatial domain according to the basis function
of the IDCT.
Processing an electronically acquired image in an Image processing system
sometimes includes shearing or rotating the image. When an image such as the one
shown in Figure 2 A is rotated, the result is shown in Figure 2B. Rotation is useful in
various applications such as image alignment, described using bilinear interpolation in
"The Image Processing Handbook" by John C. Russ, 1992 CRC Press Inc., pp. 95-100.
The primary object of the current invention is to provide a system for shearing or
rotating an image which is more efficient than existing systems and which is
complementary to international compression standards such as ISO/IEC 10918-1, Section
A.3.3 set by the International Organization of Standards, Joint Photographic Experts Group
and similar standards recognized by the Motion Picture Experts Group. Other objects of
the invention will, in part, appear hereinafter and, in part, be obvious when the following
detailed description is read in conjunction with the drawings.
SUMMARY OF THE INVENTION
An image processing system for skewing (i.e. shearing) or rotating an image
includes: an image acquisition device for acquiring an image signal representing an image;
a first memory for buffering the image signal; a second memory for storing predetermined
discrete cosine transform coefficients; a matrix multiplier for multiplying the image signal
times the predetermined coefficients to produce an output signal; a third memory for
buffering the output signal; control sequencer logic for controlling operation of the image
processing system; and an output device for providing a skewed or rotated image
corresponding to the output signal.
Image rotation according to the inventive system is facilitated by performing a
series of shearing operations on image data points representing an image in the spatial
WO 96/36941 PCT/US96/01418
domain. A first shearing operation produces a first sheared image by shearing the original
image. A second shearing operation produces a second sheared image by shearing the
first sheared image. A third shearing operation produces a third sheared image
corresponding to a rotated image by shearing the second sheared image. For each
shearing operation the image data points are transformed into DCT coefficients in a
frequency domain using a discrete even cosine transform (DCT). Thereafter, skewed
image data points are generated by taking a modified inverse discrete even cosine
transform (IDCT) of the DCT coefficients using a modified IDCT basis function
it upon either a horizontal or vertical offset
BRIEF DESCRIPTION OF THE DRAWINGS
The aforementioned aspects and outer features of the invention are described in
detail in conjunction with the accompanying drawings in which the same reference
numerals are used throughout for denoting corresponding elements and wherein:
Figure 1 is a preferred embodiment of an electronic imaging system according to
Figure 2A is an image; ...
Figure 2B is the image of Figure 2A rotated 90 degrees counterclockwise;
Figure 3 A is a detailed block diagram of the electronic imaging system of Figure
20 1;
Figure 3B is a logic diagram of the matrix multiplier 358 of Figure 3A;
Figure 3C is a timing diagram of signals used and generated by the system of
Figure 3A;
Figure 4A is a flowchart, according to the invention, for horizontally skewing an
image represented in a two-dimensional spatial plane using the system of Figure 3 A;
Figure 4B is a block diagram of the steps of block 8 of Figure 4A for transforming
segments of image data points, 'whereby each segment includes all the image data points
of a row;
Figure 4C is a block diagram of the steps of block 8 of Figure 4A for transforming
segments of image data points, whereby each segment has fewer image data points than
the total number of image data points in the row,
4
WO 96/36941 PCT/US96/01418
Figure 5A is a flowchart, according to the invention, for vertically skewing an
image represented in a two-dimensional spatial plane using the system of Figure 3A;
Figure 5B is a block diagram of the steps of block 46 of Figure 5A for
transforming segments of image data points, whereby each segment includes all the image
data points of a column; and
Figure 5C is a block diagram of the steps of block 46 of Figure 5 A for
transforming segments of image data points, whereby each segment has fewer image data
points than the total number of image data points in a column.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
The present invention relates to an image processing system and associated image
processing methods for shearing or rotating an image of a scene. Figure 1 illustrates one
exemplary embodiment of such a system. As can be seen, Figure 1 illustrates an
electronic image processing system where an image signal source, such as an electronic
still camera 10 or a scanner 12, provides an electronic image signal which represents an
image of the subject (not shown). A computer 18 receives the electronic signal from the
image signal source and thereafter processes the image signal electronically to provide any
number of known image processing functions such as resizing, sharpening, noise removal,
reflection, edge detection or rotation. The processed image can be transmitted, i.e. output,
to any destination device or destination application such as a diskette 16, an user monitor
20, a printer 14, or a remote monitor 26. Operator interaction with the system is
facilitated by use of a keyboard 22 or a mouse 24. Of course, the components shown in
Figure 1 are merely exemplary rather than all inclusive of the many equivalent devices
known by those skilled in the art For instance, the image signal source could include any
device which acts as an image signal source such as an electronic camera, a scanner, a
camcorder, a charge coupled device, a charge injected device, etc. Also, it is noteworthy
that the processing of the image need not necessarily occur solely within the computer 18.
Indeed, various phases or aspects of the image processing could occur in the image signal
source, the computer, or the destination output device.
The image processing system of Figure 1 is further detailed in Figure 3A which
includes an image signal source 350 connected to an image acquisition device 355 which,
in turn, is connected to RAM 356 and control sequencer logic 360. The RAM 356 is also
WO 96/36941 PCT/CS96/01418
connected to a matrix multiplier 358 and die control sequencer logic 360. The control
sequencer logic 360 and the matrix multiplier 358 are connected to one another and are
both connected to ROM 352, ROM 354, ROM 357 and RAM 36* The RAM 364 and the
control sequencer logic 360 are both connected to an image generator 365 which
5 represents any type of device capable of outputting an image, such as a printer, a CRT
display, etc. The control sequencer logic 360 receives a clock pulsetrain 361 from system
clock 362.
Operation of the image processing system of Figure 3 A, useful for image shearing
and rotation, will be more easily understood in view of die following section on DCT
10 mathematics.
This section sets forth certain fundamental concepts relating to forward and inverse
1 5 discrete cosine transforms.
An image is typically made up of a two-dimensional PxQ array of descriptors
called pixels or image data points, where P is the number of rows and Q is the number of
columns repr e sen ting the image. The image can be represented by either image data
points in the spatial domain, or by corresponding DCT coefficients in the frequency
20 domain. A forward DCT generates the DCT coefficients by taking a discrete even cosine
transformation (DECT abbreviated as DCT) of the image data points. Conversely, an .
inverse discrete even cosine transformation (IDECT abbreviated as IDCT) generates the
IDCT coefficients (i.e. reconstructed image data points) by taking an inverse discrete
cosine transformation of the DCT coefficients.
25 A DCT transformation can occur in any number of dimensions as understood by
those skilled in the art In the following one-dimensional example, a row (more
genetically referred to as a segment) of N image data points sQ) can be transformed
from the spatial domain to corresponding DCT coefficients S(v) in the frequency domain
in accordance with equation (1).
WO 96/3*941
PCT/US96/01418
S(v)»C v ||g 80)co8^22L (D
where: 0 & v £ (N -1), v an integer;
s(j) represents the matrix of image data points in the segment;
S(v) represents the corresponding matrix of DCT coefficients;
N represents the number of image data points in the segment;
C v - -L for v = 0; and
C v - 1 for v # 0.
The DCT coefficients S(v) are determined from equation (1) where the normalized cosine
basis terms are derived for a segment having N image data points. The value for S(0) is
determined for v =» 0 by summing each of the image data points sQ) for 0£j£(N-l)
times the cosine terms of the basis function. The value for S(l) is determined as the
summation of image data points sQ) times the cosine terms for v - 1 . This procedure,
which indexes first on v and then on j, is repeated for derivation of DCT coefficients
S(0) through S(N-1).
A modified inverse discrete cosine transformation is mathematically defined in
equation (2) where the one-dimensional matrix S(v) of DCT coefficients is transformed to
a reconstructed matrix l(y) of reconstructed image data points, and y is defined as a real
number within the given range as disclosed and explained in the parent application 795.
1 N r-0 2N
where: 0 i y S (N -1), y a real number,
S(v) represents the matrix of DCT coefficients;
2<y) represents the spatial matrix of reconstructed image data points;
N represents the number of image data points in the segment;
WO 96/36941
rcvvswoiAii
C v - — for v - 0; and
C ¥ - 1 for v * 0.
If the DCT coefficients S(v) of equation (1) are computed from a set of image data points
sQ) and the reconstructed image data points §(y) of equation (2) are computed from the
corresponding DCT coefficients S(v), then sO) ■ My) when y-j, and the process is
referred to as invertible or one-to-one mapping since the reconstructed image data points
of §(y) are identical, within limits, to the original image data points of s(j>. By evaluating
y in equation (2) at other (non-integer) values where 0 £ y £ (N -1), a modified IDCT is
obtained which can be used for various processes such as the interpolation of values
falling between discrete image data points which represent an image.
In determining the values representing the reconstructed image data points S(y)
using equation (2), 3(0) is determined by summing each of the DCT coefficients S(v)
times the cosine terms of the inverse basis function for y - 0. For example, the value for
3(0.5) is determined as the summation of DCT coefficients S(v) times the cosine terms for
y - 0.5. This procedure, which indexes first on y then en v, is repeated for derivation
of all desired reconstructed image data points §(y) where 0 i y £ (N -1).
As earlier noted, the above mathematics can be readily expanded to multiple
dimensions as known by one of ordinary skill in the art. For instance, an image can be
represented in the spatial domain in two-dimensional format as described in parent
application '795, where s(y,x) represents the image data points at real values y and x
in the spatial domain, S(v,u) represents the corresponding DCT coefficients in the
frequency domain, x ranges from 0 to (P-l), y ranges from 0 to (Q-l), P represents the
total number of rows, and Q represents the total number of columns. The image data
points s(ypc) may represent, but are not limited to, parameters such as brightness,
luminance, color or hue.
Both equations (1) and (2) can alternatively be expressed in matrix notation. The
matrix notation (without indices) for equation (1) is:
S - FB s (3)
W096/36M1 PCTAJSJW01418
where S represents the matrix of DCT coefficients, s represents the matrix of image
data points in the spatial domain, and FB represents the forward DCT basis matrix. The
matrix notation for equation (2) is:
3«IBS (4)
where S represents the spatial matrix of reconstructed image data points, and IB
represents the inverse DCT basis matrix for the desired output points (i.e. reconstructed
image data points). Combining matrix equations (3) and (4) will reduce the number of
arithmetic operations as opposed to performing the matrix algebra in two different steps as
previously described. Combining matrix equations (3) and (4) yields:
3 » IB ; (FB • s)
- MB • s (5)
where MB is a combined DCT basis matrix derived from matrix multiplication of the
inverse DCT basis matrix IB times the forward DCT basis matrix FB. The combined
DCT basis matrix MB can be contemporaneously calculated while solving equation (5),
or MB can be precalculated and stored in a look-up table.
2. Image Rrrtarinn Hardware
The preferred embodiment of an image rotation system shown in Figure 3A
includes: image signal source 350; image acquisition device 355; matrix multiplier 358;
random access memory (RAM) image buffers 356 and 364; coefficient read only memory
(ROM) 352, 354 and 357; control sequencer logic 360; master clock 362; and image
generator 365. The master clock 362 produces a master clock signal 361 which is used by
the control sequencer logic 360 to generate clock signals CK1 and CK2. The image
acquisition device 355 could be any hardware for acquiring an image from the source 350,
such as an input port, an A/D converter, etc. Similarly, the image generator 365 could be
any device or system for generating an image from the coefficients stored in RAMs 356 or
364, such as a printer, cathode ray tube, etc. This overall hardware configuration is
general purpose for implementing a variety of matrix product operations.
Referring to Figures 3A, 3B and 3C, the matrix multiplier logic 351 is a fixed
point parallel arithmetic processor capable of computing a 3x3 matrix product in nine CK1
clock cycles. The control sequencer logic 360 generates clock pulses CK1 and CK2 from
the master clock 362. The buffers 356 and 364 are random access memories to buffer the
input and output images; the read only memories 352, 354 and. 357 store precomputed
coefficient matrices; and the control sequencer logic 360 is used to handle control signals,
timing signals, and memory address signals.
5 The matrix multiplier logic 358 is a three fixed-point multiplier accumulator
(MAC) array shown in detail in Figure 3B, with input/output latches and two bi-directional
data buses 372 and 374. The buses 372 and 374 are configurable to transmit data directly
between RAM 356 and RAM 364 in pass through mode, or to transmit dam to the matrix
multiplier logic 358 for processing in process mode according to Truth Table I which
10 defines the functions of data buses 372 and 374 as controlled by signals I„ and I,.
TRUTH TABLE I
lo
I,
372
374
Mode
0
0
IN
OUT
Pass thru
1
0
IN
OUT
Process
0
1
OUT
IN
Process
1
OUT
IN
Pass thru
WO 94/36941 PCT/US9C/01418
The three MAC units include multipliers 400, 402, and 404 followed by the adder
and accumulator pairs {414, 420}, {416, 422}, and {418, 424}, respectively. The outputs
of the accumulators 420, 422 and 424 are stored, respectively, in output latches 426, 428,
and 430. These provide temporary storage to multiplex the results onto the common
output bus 431.
The control sequencer logic 360 controls the memories and data buses as well as
generating appropriate timing signals for the matrix multiplier logic 358. Specifically, the
control sequencer logic 360 provides to RAM memories 356 and 364, respectively,
address data on lines 378 and 382, and read/write (R/W) control data on lines 376 and
380. The control sequencer logic 360 also provides the matrix multiplier logic 358 with
clock signals CK1 and CK2 (derived from master clock signal 361), bus directional signals
1«, I,; output multiplex control signals 0* 0,. and addresses on line 366 for ROMs 352,
354 and 357. The control sequencer logic 360 is easily implemented with a
microcontroller or programmable logic array (PLA), the choice being application
dependent The former is generally more flexible from a programming standpoint but
somewhat higher in cost than the latter.
The operation of the matrix multiplier logic 358 is easily understood by
considering an example of a 3x3 matrix multiplication where C represents a coefficient
matrix, D represents a source image data matrix, and B represents the result of matrix
multiplying C times D. Thus, for
B U B 13 B t3
K D» Du
Bj, B,, B„
X
D 2l D a D„
, D 3i D„ D M
consider the first column of B
column of D,
which is the sums of products of the rows of C and the first
WO 96/36941
PCI7CS96/01418
Cu D n
+ c iPn
ft*
(7)
The timing diagram in Figure 3C shows the relationship of the control and data
signals for this example. The computation proceeds sequentially with the evaluation of
the first, second, and third columns of the B matrix. The process begins with the clearing
5 of the accumulators by the negative RESET pulse received by the matrix multiplier logic
358 from the control sequencer logic 360. The first column of matrix C, that is, C,„ C a „
Cj„ and the first element of the first column of matrix D, that is D,„ are transferred to
input latches 406, 408, 410 and 412 respectively at time T, of clock pulse CKl.
Specifically, C„ is received from ROM 352 by input latch 406, C, 2 is received from ROM
10 354 by input latch 408, C u is received from ROM 357 by input latch 410, and D„ is
received through the bus transmitter 432 from RAM 356 which stores the source image.
The control signals I<, and I, control both the transfer, and the direction of transfer of data
between the matrix multiplier logic 358 and RAMs 356 and 364 according to Truth Table
I. The logic corresponding to Truth Table I is shown by logic 434 and bus transmitter
15 432. At time T, the products C„D,„ Cj,D,„ and C,,D„ are stored in accumulators 420,
422, and 424, respectively. Logic (not shown) for scaling the outputs, i.e. truncating data,
would typically follow the accumulators, as well known by those skilled in the art, to
handle data overflow conditions. At time T 3 the second column of matrix C, that is, C 12 ,
Cjj, and Cjj, and the second element D 2 , of the first column of D are transferred to the
20 input latches 406, 408, 410 and 412, respectively. The partial sum of products, that is,
C„Di,+C,2Psi, C2,D n +Cj,D 2 „ Cj,D„+CjjD 2 , are stored at time T« in accumulators 420,
422, and 424 respectively. Of course, multiplication occurs in multipliers 400, 402, 404
and addition occurs in adders 414, 416, 418. The process repeats for the third column of
C and third element of the first column of D resulting at T 4 in the first column of B
25 having elements {CuDh+C^+C.jDj,}, {CajDn+CjAj+CaDj,} and
{C^Dh+CjjD^+CjjDj,} which were obtained as the sum of the products of the rows C
and the first column of D (see equation (7)).
At the rising edge of clock pulse CK2 at time T, the data from accumulators 420,
422, and 424 is transferred to the output latches 426, 428, and 430, respectively. Output
13.
W096/36W1 PCT/USW/M418
multiplex control signals O, and O, time multiplex the outputs of the output latches onto
data bus 372 or 374 in accordance with Truth Table I at times T„ T„ and T 10 . The whole
process is repeated in this fashion for computing the remaining second and third columns
of B in equation (6).
5 The above implementation of the hardware of Figures 3A and 3B describes one
complete shear of the image data points representing the image. Of course, to properly
carry out image rotation as described in the following section on miage Rotation
Methodology, three shears are required. Thus, the data points stored in RAM 364 after
one shearing operation should be transferred fa pass-through mode (see Truth Table f)
10 from RAM 364 through matrix multiplier 358 to RAM 356. The shearing process repeats
until all three shears have been accomplished and the resultant image data points stored in
RAM 364 represent the rotated image.
In sum, the matrix multiplier logic 358 of Figure 3A is ideally suited to perform
image rotation. The first and second rows of both the horizontal and vertical transform
15 matrices are stored in ROMs 352 and 354, respectively; the source image is stored in
RAM 356; and the rotated image is stored in RAM 364. Note that the matrix multiplier
logic 358 is clearly able to handle products of matrices other than order three. For
example, vector products, as well as 2x2 matrix products can be easily processed.
Of course functional equivalents of the above described components would work
20 equally well in implementing the image rotation device. For instance, the memories are
not limited to RAMs or ROMs but include any type of known memory devices. Also as
earlier noted, the coefficients stored in ROMs 352, 354 and 357 could be precalculated as
a combined reconstruction vector to be multiplied times the source pixels received from
RAM 356 rather than as precalculated modified EDCT basis coefficients to be multiplied
25 times precalculated DCT coefficients stored in RAM 356.
t.
GO005278S
wo9«/3694i wcmmmua
3. Image Rotation Methodology
The image processing system of Figure 3 A provides image rotation in the DCT
domain by decomposing the rotation operation into a sequence of shearing operations. A
rotation matrix can be represented in the form:
tcosd -smOj
smO cos6 )
5
where 9 is the desired angle of rotation. This rotation matrix can be decomposed into a
product of shearing matrices where
'".63
represents a vertical shearing matrix for shearing an image vertically and
10
represents a horizontal shearing matrix for shearing an image horizontally. The rotation
matrix can be decomposed as either
[cose -sineWl a ) (l o\ (l a] (6)
Unfi cosej Ip 1 J lp i) \0 lj
15
where die vertical shearing factor is defined as B - sin 6 and die horizontal shearing
factor is denned as a = - tan (9/2), or,
■H -^l - f 1 °] f 1 A f 1 °] co
isme cos8 J \p 1 J \0 lj \p lj
20
where the vertical shearing factor is defined as P =- tan ( 9/2) and the horizontal
shearing factor is defined as a --sin 6.
In the case of the decomposition in equation (6), an input image is rotated by an
angle 9 by (1) horizontally shearing the input image to produce a first skewed image (2)
25 vertically shearing the first skewed image to produce a second skewed image, and (3)
horizontally shearing the second skewed image to produce the rotated image.
WO 96/3*941 PCI7US96/0i41§
In the case of the decomposition in equation (7), an input image is rotated by an
angle 9 by (1) vertically shearing the input image to produce a first skewed image (2)
horizontally shearing die first skewed image to produce a second skewed image, and (3)
vertically shearing the second skewed image to produce the rotated image.
5 In the general case of image rotation using the decomposition in either equation (6)
or (7), the image rotation method includes the steps of:
(!) providing a first spatial matrix of pixels representing an original image in two
dimensions;
(ii) generating a first shear matrix (also referred to as a skew matrix)
10 corresponding to a first skewed image by shearing the first spatial matrix in a first
dimension;
(Hi) generating a second shear matrix corresponding to a second skewed image
by shearing the first shear matrix in a second dimension; and
(iv) generating a third shear matrix corresponding to a third skewed image, i.e.
15 the image rotated by 6, by shearing the second shear matrix in the first dimension.
A major advantage behind performing the shearing in the above manner is the decoupling
of the horizontal and vertical shears, so that a new sampling grid for each shear step need
only be computed in one dimension. Of course, it is not always necessary or desirable to
shear all pixels of an image. In some instance, shearing a portion of the image is
20 sufficient
An image, represented as s(j,i), can be horizontally sheared as shown in Figure 4A
according to the decomposition in equation (6), where 0 is initialized to a desired angle of
rotation, and the row index r is initialized to zero in block 440. P represents the total
number of rows in the image, and Q represents the total number of columns in the
25 image. Since the image is represented as a PxQ matrix, Q also represents the total
number of image data points in each row. Thus, if a segment size equal to the total
number of pixels in a row is selected, then Q columns in Figure 4A equates with N
image data points in equation (1). In block 442, the horizontal shearing factor a - -
tan(9/2) is calculated and in block 444 the horizontal offset AH is determined as the
30 product of the horizontal shearing factor a times the row index r (i.e. AH - ctr).
In block 446 all pixels in row r are shifted by the same horizontal offset AH.
This is done using the process shown in either Figure 4B or 4C. Figure 4B shows the
i
000052785
WO 96/36941
PCT/US96J01418
appropriate steps for
and
a complete row of image data points in one
step whereas Figure 4C shows the appropriate steps for shifting and transforming a row of
image data points by first dividing the row into N point segments, where N is equal to
or less man the number of pixels in a row. Specifically, the DCT transformation process
5 of Figure 4C is performed by taking DCT transformations of adjacent three-point segments
(i.e. N-3) of image data points having a one-point overlap, whereas the DCT
transformation process of Figure 4B is performed by taking DCT transformations for N-Q.
For either Figure 4B or 4C, each pixel of a given row is shifted by the value of the
horizontal offset AH, which is generally not an integer and can be written as:
10 AH - I ah + 4h
where I w and f^ are the integer and fractional parts of AH, respectively. The shift by
AH can be implemented by computing the interpolated values of the IDCT coefficients
5(y) as though the shift was by and then just offsetting the result by 1^ in the output
matrix. The modified IDCT of equation (2) is then evaluated according to equation (8) to
15 compensate for the fractional part of the horizontal offset, f m .
20
25
where:
0 < G-f*d * (N-l), j an integer,
0 < y £ (N-l), y a real number,
S(v) represents the matrix of DCT coefficients;
§(y) represents the spatial matrix of reconstructed (skewed) image data
points;
§ 0-1) represents §<y) with the y index changed to integer values;
N represents the number of DCT coefficients in the segment;
WO 9606941
C„ - -L for v - 0; and
C v -1 for v + 0.
For the above values of y, there will be (N-l) reconstructed image data points
S whether N represents either the total number of pixels in the row (Fig. 4B) or a lesser
number of pixels of a segment (Fig. 4C). All of the reconstructed image data points will
lie in the spatial range between 3(0) and S(N-1).
In block 448, the row index r is incremented by one, then tested against the total
number of rows P in block 450. If r < P. then the sequence of steps in blocks 444-450
10 is repeated. In effect, each pass through blocks 444-450 skews another row of pixels of
the image. Completion of the shearing of each row in the image results in a first
horizontally skewed image.
The preferred implementation of the above skewing method is achieved by
segmenting the desired row of the image into smaller, overlapping blocks of image data
15 points of size N as shown by the steps of Fig. 4C. It has been empirically determined
that a three or four point DCT with a one-point overlap between adjacent segments
provides excellent results (although any segment size N could be selected). When N-Z,
the speed of computation is high but the accuracy of the results suffers. In contrast when
N-8, the accuracy of the interpolation is high but computational speed is slow. Thus, a
20 preferred embodiment provides a row of image data points subjected to a three-point DCT
according to equation (1) when N-3. Thus, N is set to 3 and the column index c is
initialized to zero in block 458. Row r equates to index i and column c equates to
index j in DCT equations (1) and (8) for an image represented spatially as sQ,i). In
block 460, a DCT according to equation (1) is taken of the first three-point segment of
25 row r, namely s(0), s(l) and s(2). The modified IDCT of equation (8) is used in block
462 to reconstruct spatial image data points 3(0) and 3(1). The reconstructed image data
point 8(2), which is overlapped with a second three-point segment, will be calculated and
included in the second segment of reconstructed image data points. The values of 3(0)
and 3(1) are stored or otherwise saved as reconstructed image data points of the
30 horizontally skewed image. In block 464 the column index c is incremented to c -
IT
I
300052785
PCT/US96/01418
WO 96/36941 PCT/USM/M418
c+N-1. In block 466, it is determined whether or not all of the pixels have been processed
for row r. If all of the pixels have been processed, then the decision block is answered in
the affirmative and the overall horizontal shearing process continues in block 448. If on
the other hand, all of the pixels in row r have not been processed, then the DCT
5 transformation of the next N-point segment continues in block 460. In the current
example, c - 2, and the next three-point DCT occurs on pixels in the same row spatially
located at s(2), s(3) and s(4). The modified IDCT of equation (8) is used to reconstruct
spatial image data points 8(2) and §(3) in block 462. The values of S(2) and 3(3) are saved
for the matrix of reconstructed image data points. The above overlapping procedure
10 continues for each pixel of the image resulting in a first horizontally skewed image
represented as a matrix of reconstructed image data points.
Once the image is horizontally skewed to form the first horizontally skewed image
according to decomposition equation (6), then the second step of equation (6) calls for
vertically shearing the first horizontally skewed image. This can be accomplished by
15 following the steps of Figure 5 A.
In block 500, 6 maintains the same value selected for horizontal skewing, and the
column index c is initialized to zero. Since the image is represented as a PxQ matrix, P
also represents the total number of pixels in each column. In block 502, the vertical
shearing factor (J - sinO is calculated and in block 504 the vertical offset AV is
20 determined as the product of the vertical shearing factor p times the column index c (i.e.
AV-pc).
In block 506 all pixels in column c are shifted by the same vertical offset AV.
This is done using the process shown in either Figure 5B or 5C. Figure 5B shows the
appropriate steps for shifting and transforming a complete column of image data points in
25 one step whereas Figure 5C shows the appropriate steps for shifting and transforming a
column of image data points by first dividing the column into N point segments, where
N is equal to or less than the number of pixels in a column. Specifically, the DCT
transformation process of Figure 5C is performed for the present embodiment by taking
DCT transformations of adjacent three-point segments (Le. N-3) of image data points
30 having a one-point overlap, whereas the DCT transformation process of Figure 5B is
performed by taking DCT transformations for N-P.
I*
W09W36941 PCTAJS9<y01418
For either Figure SB or 5C, each pixel of a given column is shifted by the value of
the vertical offset AV, which is generally not an integer and can be written as:
AV-I 4V + f AV
where I 4V and f 4V are the integer and fractional parts of AV, respectively. The shift by
AV can be implemented by computing the interpolated values of the IDCT coefficients
§(y) as though the shift was by f 4V and then just offsetting the result by I AV in the output
matrix. The modified IDCT of equation (2) is then evaluated according to equation (9) to
b for the fractional part of the vertical offset, f AV .
» 0-D » W U*,- J| g c, sm «=''.«-v» <»>
15 points:
where: 0 < (j-f 4 v) * (N-l), j an i
0 < y £ (N-l), y a real i
S(v) represents the matrix of DCT coefficients;
S(y) represents the spatial matrix of reconstructed (skewed) image data
§ 0-1) represents S(y) with the y index changed to integer values;
N represents the number of DCT coefficients in the segment;
C v - -L forv«6;and
ft
C ¥ «lforv#0.
For the above values of y, there will be (N-l) reconstructed image data points
whether N represents either the total number of pixels in the column (Fig. 5B) or a lesser
number of pixels of a segment (Fig. 5C). All of the reconstructed image data points will
lie in the spatial range between 3(0) and S(N-1).
In block 508, the column index c is incremented by one, then tested against the
total number of columns Q in block 510. If c < Q, then the sequence of steps in blocks
504-510 is repeated. In effect, each pass through blocks 504-510 skews another column
000032783
W096/365M1 PCT/US94/01418
of pixels of the image. Completion of the shearing of each column in the first
horizontally skewed image results in a first vertically skewed image.
As described above, the current example uses a three-point DCT with a one-point
overlap between adjacent segments. Specifically, a column of image data points is
subjected to a three-point DCT according to equation (1) when N-3. Thus, N is set to 3
and the row index r is initialized to zero in block 518. Row r equates to index i and
column c equates to index j in DCT equations (1) and (8) for an image represented
spatially as $0,0- hi block 520, a DCT according to equation (I) is taken of the first
three-point segment of column c, namely s(0), s(l) and s(2). The modified IDCT of
equation (8) is used in block 522 to reconstruct spatial image data points 1(0) and §(1).
The reconstructed image data point S(2), which is overlapped with a second three-point
segment, will be calculated and included in the second segment of reconstructed image
data points. The values of 3(0) and 8(1) are stored or otherwise saved as reconstructed
image data points of the vertically skewed image. In block 524 the row index r is
incremented to r - r+N-1. In block 526, it is determined whether or not all of the pixels
have been processed for column c. If all of the pixels have been processed, then the
decision block 526 is answered in the affirmative and the overall horizontal shearing
process continues in block 508. If on the other hand, all of the pixels in column c have
not been processed, then the DCT transformation of the next N-point segment continues
in block 520. In the current example, the value of r in block 526 is 2 after processing
the first three-point segment, and the next three-point DCT occurs on pixels in the same
column spatially located at s(2), s(3) and s(4). The modified IDCT of equation (8) is used
to reconstruct spatial image data points 8(2) and S(3) in block 522. The values of 9(2) and
3(3) are saved for the matrix of reconstructed image data points. The above overlapping
procedure continues for each pixel of the- image resulting in a first vertically skewed
image represented as a matrix of reconstructed image data points.
Once the first vertically skewed image is obtained by vertically skewing the first
horizontally skewed image, then the final step for rotating the image by an angle 0,
according to decomposition of the rotation matrix as shown in equation (6), can be
implemented by horizontally skewing the first vertically skewed matrix in an effort to
obtain a second horizontally skewed matrix. This is done in the same manner as described
above in accordance with the process shown in Figures 4 A, 4B and 4C for obtaining the
WO 96/36941 PCMJS96/01418
first horizontally skewed matrix. For the preferred embodiment using the three-point DCT
with a one-point overlap, the processing steps of Figure SC are followed. The second
horizontally sheared image (represented by the second horizontally skewed matrix), which
is obtained by horizontally skewing the first vertically skewed image, corresponds to the
original image rotated by the selected angle 6.
The same procedures described above can be applied when rotating the image by
decomposition of the rotation matrix of equation (7). However, the value of the horizontal
shearing factor of block 442 in Figure 4A will be changed to a - -sin6, and the value of
the vertical shearing factor of block 502 in Figure 5A will be changed to p - tan(0/2).
It is to be understood that the above described embodiments are merely illustrative
of the present invention and represent a limited number of the possible specific
embodiments that can provide applications of the principles of the invention. Numerous
and varied other arrangements, which are structurally or operationally equivalent to the
claims as described herein, may be readily devised in accordance with these principles by
those skilled in the art without departing from the spirit and scope of the invention as
claimed.
. Claims:
1. An image processing system for skewing or rotating an image received
from an image signal source and represented as an image signal, said system comprising:
means for acquiring the image signal from the image signal source;
a first memory for buffering said acquired image signal;
a second memory for storing predetermined discrete cosine transform coefficients;
a matrix multiplier for multiplying said image signal retrieved from said first
memory times said predetermined coefficients to produce an output signal;
a third memory for buffering said output signal;
control sequencer logic for controlling operation of said image processing system;
and
an output device for retrieving said output signal from said third memory and
providing a skewed or rotated image corresponding to said output signal.
2. An image processing device for skewing a segment of an image, said image
represented by pixels aligned in rows and columns in a spatial domain, said device
comprising:
means for selecting an image rotation angle;
means for initializing one of a column index and a row index;
means for determining a shear factor trigonometrically related to said image
rotation angle;
means for determining an offset related to said shear factor and said one of the
column index and the row index;
means for generating DCT coefficients of the segment of pixels related to said one
of die column index and the row index by taking a forward discrete even cosine
transformation (DCT) of said segment; and
means for generating modified IDCT coefficients of said segment by taking a
modified inverse discrete even cosine transformation (IDCT) of said DCT coefficients,
said modified IDCT coefficients corresponding to said skewed segment of the image.
WO 96/3694 1 PCT/US9«m418
3. The image processing method of claim 2, further comprising means for
generating said skewed segment of the image in response to said modified IDCT
4. An image processing device for skewing an image represented by pixels
aligned in rows and columns in a spatial domain, said method comprising:
means for selecting an image rotation angle 9;
means for initializing an integer row index r,
means for determining a horizontal shear factor a - -tan(6/2); .
means for determining a horizontal offset AH defined as the product of the
horizontal shear factor a times the row index r,
means for generating DCT coefficients of segments of pixels related to said row
index r by taking a forward discrete even cosine transformation (DCT) of said segments;
means for generating modified IDCT coefficients corresponding to shifted pixels by
taking a modified inverse discrete even cosine transformation (IDCT) of said DCT
coefficients of said segments using a modified IDCT basis matrix dependent upon said
horizontal offset; and
means for incrementing said row index r by one so that a next row of said image
can be processed.
5. The image processing device of claim 4, further comprising means for
generating said skewed image in response to said modified IDCT coefficients.
6. An image processing device for skewing an image represented by pixels
aligned in rows and columns in a spatial domain, said device comprising:
means for selecting a predetermined image rotation angle 8;
means for initializing an integer column index c;
means for determining a vertical shear factor P - sin 6;
means for determining a vertical offset A V defined as the product of the vertical
shear factor 0 times the column index c;
means for generating DCT coefficients of segments of pixels related to said column
index c by taking a forward discrete even cosine transformation (DCT) of said segments;
i
300052785
means for generating modified IDCT coefficients corresponding to shifted pixels by
taking a modified inverse discrete cosine transformation (IDCT) of said DCT coefficients
of said segments using a modified IDCT basis function dependent upon said vertical
offset; and
means for incrementing said column index c by one so that a next column of said
image can be processed.
7. The image processing device of claim 6, further comprising means for
generating said skewed image in response to said modified IDCT coefficients.
8. An image processing device which provides an output image rotated
relative to an input image by an angle 8, said device comprising:
(a) means for providing an array of rows and columns of pixels representing the
input image;
(b) means for producing a first sheared image by performing a first shearing
operation on said array, said means for producing a first sheared image comprising:
(bl) means for initializing an integer row index r,
(b2) means for determining a horizontal shear factor a - -tan(6/2);
(b3) means for determining a horizontal offset AH defined as the product of
the horizontal shear factor a times the row index r,
(b4) means for generating DCT coefficients, of a segment of pixels related
to said row index r, by taking a forward discrete even cosine transformation
(DCT) of said segment;
(b3) means for generating modified IDCT coefficients, corresponding to
shifted pixels, by performing a modified inverse discrete even cosine
transformation (IDCT) of said DCT coefficients of said row index r using a
modified IDCT basis function dependent upon said horizontal offset;
(b6) means for incrementing said row index r by one so that a next row of
said image can be processed;
(c) means for producing a second sheared image by performing a second shearing
operation on said first sheared image, said means for producing a second sheared image
comprising:
a4
W096M6941 PCT/US96/01418
(cl) means for initializing an integer column index c;
(c2) means for determining a vertical shear factor P - sin 9;
(c3) means for determining a vertical offset AV defined as the product of
the vertical shear factor 3 times the column index c;
(c4) means for generating DCT coefficients of a segment of pixels related
to said column index c by taking a forward discrete even cosine transformation
(DCT) of said segment;
(c5) means for generating modified IDCT coefficients, corresponding to
shifted pixels, by performing a modified inverse discrete cosine transformation
(IDCT) of said DCT coefficients of said column index c using a modified IDCT
basis function dependent upon said vertical offset;
(c6) means for incrementing said column index c by one so that a next
column of said image can be processed; and
(d) means for producing a third sheared image corresponding to said output image
by performing said first shearing operation on said second sheared image.
9. An image processing device which provides an output image rotated
relative to an input image by an angle 8, said device comprising:
(a) means for providing an array of rows and columns of pixels representing the
input image;
(b) means for producing a fust sheared image by performing a fust shearing
operation on said array, said means for producing a first sheared image comprising:
(bl) means for initializing an integer column index c;
<b2) means for determining a vertical shear factor 0 - tan (6/2);
(b3) means for determining a horizontal offset AH defined as the product of
the vertical shear factor 0 times the column index c;
(b4) means for generating DCT coefficients of a segment of pixels related
to said column index c by taking a forward discrete even cosine transformation
(DCT) of said segment;
(b5) means for generating modified IDCT coefficients, corresponding to
shifted pixels, by performing a modified inverse discrete even cosine
"WO 96/36941 PCI/USJHtfOMM
transformation (IDCT) 0 f said DCT coefficients of said column index c using a
modified IDCT basis function dependent upon said vertical offset;
(c6) means for incrementing said column index c by one so mat a next
column of said image can be processed; and
(c) means for producing a second sheared image by performing a second shearing
operation on said first sheared image, said means for producing a second sheared image
comprising:
(cl) means for initializing an integer row index r;
(c2) means for determining a horizontal shear factor a » -sin 6;
(c3) means for determining a horizontal offset AH defined as the product of
the horizontal shear factor a times the row index r,
(c4) means for generating DCT coefficients of a segment of pixels related
to said row index r by taking a forward discrete even cosine transformation
(DCT) of said segment;
(c5) means for generating modified IDCT coefficients, corresponding to
shifted pixels, by performing a modified inverse discrete cosine transformation
(IDCT) of said DCT coefficients of said row index r using a modified IDCT
basis function dependent upon said vertical offset;
(c6) means for incrementing said row index r by one so that a next row of
said image can be processed; and
(d) means for producing a third sheared image corresponding to said output image
by performing said first shearing operation on said second sheared image.
10. An image processing method for skewing a segment of an image, said
image represented by pixels aligned in rows and columns in a spatial domain, said method
comprising the steps of:
selecting an image rotation angle;
initializing one of a column index and a row index;
determining a shear factor trigonometrical ly related to said image rotation angle;
determining an offset related to said shear factor and said one of the column index
and the row index;
WO 96/36941 PCT/US96/01418
generating DCT coefficients of the segment of pixels related to said one of the
column index and the row index by taking a forward discrete even cosine transformation
(DCT) of said segment; and
generating modified IDCT coefficients of said segment by taking a modified
inverse discrete even cosine transformation (IDCT) of said DCT coefficients, said
modified IDCT coefficients corresponding to said skewed segment of the image.
11. The image processing method of claim 10, further comprising the step of
generating said skewed segment of the image in response to said modified IDCT
coefficients.
12. An image processing method for skewing an image represented by pixels
aligned in rows and columns in a spatial domain, said method comprising the steps of:
(12a) in a microcomputer, selecting an image rotation angle 6;
(12b) in said microcomputer, initializing an integer row index r,
(12c) in said microcomputer, determining a horizontal shear factor be - -tan(0/2);
(12d) in said microcomputer, determining a horizontal offset AH defined as the
product of the horizontal shear factor a times the row index r,
(12e) in an array processor, generating DCT coefficients of a segment of pixels
related to said row index r by taking a forward discrete even cosine transformation
(DCT) of said segment;
(12f) in said array processor, generating modified IDCT coefficients corresponding
to shifted pixels by taking a modified inverse discrete even cosine transformation (IDCT)
of said DCT coefficients of said segment using a modified IDCT basis matrix dependent
upon said horizontal offset;
(12g) repeating steps (12e) and (12f) until said modified IDCT coefficients are
generated for every pixel corresponding to said row index r, and
(12h) incrementing said row index r by one, then repeating steps (12d) through
(12g) until said modified IDCT coefficients are generated for every pixel of the image.
13. The image processing method of claim 12, further comprising the step of
generating said skewed image in response to said modified IDCT coefficients:
WOJK/WMl PCT/US96/M418
14. An image processing method for skewing an image represented by pixels
aligned in rows and columns in a spatial domain, said method comprising the steps of:
(14a) in a microcomputer, selecting a predetermined image rotation angle 8;
(14b) in said microcomputer, initializing an integer column index c;
(14c) in said microcomputer, determining a vertical shear factor P » sin 8;
(14d) in said microcomputer, determining a vertical offset AV defined as the
product of the vertical shear factor P times the column index c;
(He) in an array processor, generating DCT coefficients of a segment of pixels
related to said column index c by taking a forward discrete even cosine transformation
(DCT) of said segment;
<14f) in said array processor, generating modified IDCT coefficients corresponding
to shifted pixels by taking a modified inverse discrete cosine transformation (IDCT) of
said DCT coefficients of said segment using a modified IDCT basis function dependent
upon said vertical offset;
(14g) repeating steps (14c) and (I4f) until said modified IDCT coefficients are
generated for every pixel corresponding to column index c; and
(14h) incrementing said column index c by one, then repeating steps (14d)
through (14g) until said modified IDCT coefficients are generated for every pixel of the
image.
15. The image processing method of claim 14, further comprising the step of
generating said skewed image in response to said modified IDCT coefficients.
16. An image processing method which provides an output image rotated
relative to an input image by an angle 8, said method comprising the steps of: .
(a) providing an array of rows and columns of pixels representing the input image;
(b) producing a first sheared image by performing a first shearing operation on said
array, said first shearing operation comprising the substeps of:
(bl) initializing an integer row index r,
(b2) determining a horizontal shear factor a - -tan(8/2);
(b3) determining a horizontal offset AH defined as the product of the
horizontal shear factor a times the row index r,
WO 96/36941 PCT/US96/014M
(b4) generating DCT coefficients, of a segment of pixels related to said row
index r, by taking a forward discrete even cosine transformation (DCT) of said
(b5) generating modified IDCT coefficients, corresponding to shifted pixels,
by performing a modified inverse discrete even cosine transformation (IDCT) of
said DCT coefficients of said row index r using a modified IDCT basis function
dependent upon said horizontal offset;
(b6) repeating steps (b4) and (b5) until said first modified IDCT coefficients
are generated for every pixel corresponding to said row index r,
(b7) incrementing said row index r by one, then repeating steps (b3)
through (b6) until said modified IDCT coefficients are generated for every pixel of
the image;
(c) producing a second sheared image by performing a second shearing operation
on said first sheared image, said second shearing operation comprising the sub steps of:
(cl) initializing an integer column index c;
(c2) determining a vertical shear factor B 30 sin 0;
(c3) determining a vertical offset AV defined as the product of the vertical
shear factor B times the column index c;
<c4) generating DCT coefficients of a segment of pixels related to said
column index c by taking a forward discrete even cosine transformation (DCT) of
said segment;
(c5) generating modified IDCT coefficients, corresponding to shifted pixels,
by performing a modified inverse discrete cosine transformation (IDCT) of said
DCT coefficients of said column index c using a modified IDCT basis function
dependent upon said vertical offset;
(c6) repeating steps (c4) and (cS) until said modified IDCT coefficients are
generated for every pixel corresponding to column index c; and
(c7) incrementing said column index c by one, then repeating steps (c2)
through (c6) until said modified IDCT coefficients are generated for every pixel of
the image; and
(d) producing a third sheared image corresponding to said output image by
performing said first shearing operation on said second sheared image.
PCT/OS9«m418
17. An image processing method which provides an output image rotated
relative to an input image by an angle 0, said method comprising the steps of:
(a) providing an array of rows and columns of pixels representing the input image;
(b) producing a first sheared image by performing a first shearing operation on said
array, said first shearing operation comprising the substeps of:
(bl) initializing an integer column index c;
(b2) determining a vertical shear factor p- tan (6/2);
(b3) determining a horizontal offset AH defined as the product of the
vertical shear factor P times the column index c;
(b4) generating DCT coefficients of a segment of pixels related to said
column index c by taking a forward discrete even cosine transformation (DCT) of
said segment;
(b5) generating modified IDCT coefficients, corresponding to shifted pixels,
by performing a modified inverse discrete even cosine transformation (IDCT) of
said DCT coefficients of said column index c using a modified IDCT basis
function dependent upon said vertical offset;
(b6) repeating steps (b4) and (bS) until said first modified IDCT coefficients
are generated for every pixel corresponding to said column index c;
(b7) incrementing said column index c by one, then repeating steps 0>2)
through (b6) until said modified IDCT coefficients are generated for every pixel of
the image;
(c) producing a second sheared image by performing a second shearing operation
on said first sheared image, said second shearing operation comprising die substeps of:
(cl) initializing an integer row index rj
(c2) determining a horizontal shear factor a " -sin 9;
(c3) determining a horizontal offset AH defined as the product of the
horizontal shear factor a times the row index r,
(c4) generating DCT coefficients of a segment of pixels related to said row
index r by taking a forward discrete even cosine transformation (DCT) of said
segment;
WO 96/36941 PCT/USSKW01418
(c5) generating modified IDCT coefficients, corresponding to shifted pixels,
by performing a modified inverse discrete cosine transformation (IDCT) of said
DCT coefficients of said row index r using a modified IDCT basis function
dependent upon said vertical offset;
(c6) shifting said segment of pixels related to said row index r and
repeating steps (c4) and (c5) until said modified IDCT coefficients are generated
for every pixel corresponding to said row index r;
(c7) incrementing said row index r by one, then repeating steps (c2)
through <c6) until said modified IDCT coefficients are generated for every pixel of
the image; and
(d) producing a third sheared image corresponding to said output image by
performing said first shearing operation on said second sheared image.
5)
WO 96/3*941
PCI7CS96/01418
2/11
FIG.2A
(PRIOR ART)
FIG. 2B
(PRIOR ART)
G000S278S
WO 96/36941
PCT/0S96/O1418
PCT/US96/01418
6/11
. SELECT 0
r»o
P rows
Q columns
a=-7
6
AN J
AH
SHIFT ROW r
BY AH
r =
r+1
end y
FIG. 4A
7/11
FROM BLOCK 444
446
TAK
OFF
= DCT
lOWr
TAKE
OF R
Ml DCT
OW r
456
TO BLOCK 443
PCTAJS96/01418
FIG. 4B
PC17US9fi/01418
8/11
FROM BLOCK 444
TO BLOCK 44a
FIG. 4C
WO 96/35941
PCT/US9«/01418
9/11
SELECT 0
C = 0
prows
Q columns
fi »
sin0
AV:
SHIFT COL. C
BY AV
C = C+1
FIG.5A
O00052783
WO 96/36941
PCT/US96701418
10/11
FROM BLOCK 504
TAKE OCT
OF COL C r 514
TAKE MIDCT
OF COL C
TO BLOCK SOS
FIG. 5B
WO 96/36941
PCT/US9«/01418
11/11
FROM BLOCK 504
TO BLOCK 522
FIG. 5C
INTERNATIONAL SEARCH REPORT
PCT/US 96/61418
C DOCUMENTS CONSIDERED TO «8 RELEVANT
W0.A.95 29461 (SARNOFF DAVID RES CENTER) 2
November 1995
see page 1, line 1 - page 3, line 5; claim
CVGIP: GRAPHICAL MODELS AND IMAGE
PROCESSING, JULY 1992, USA,
vol. 54, no. 4. ISSN 1049-9652,
pages 340-344, XP060288359
DAN1ELSS0N P E ET AL: "High-accuracy
rotation of images"
see page 348, right-hand column, paragraph
5 - page 341, left-hand column, paragraph
see page 342, right-hand column, paragraph
2 - paragraph 3
(3 f
[X"j Pttm* tuaiy mrabm mt tiDml fa man.
n of mean* of tt» iau
25. OK 96
Ewopon Punt Omo, P.B. nitPOoMim:
NL • ma hv Rlfenrifc
Td. ( ♦11-701 MO-JMO, T*. 31 651 niK,
Pierfederici, A
page 1 of 2
INTERNATIONAL SEARCH REPORT
PCT/US 96/91418
G(poafaattiea) DOCUMENTS CONSIDERED TP BE RELEVANT
nwttim rf -VrnmrnWi rHti totoatom wiwm »ppmpri«. iVttw rrtmni |Wi|»il
RdcmatMcItfanN*
A
PROCEEDINGS CVPR '86: IEEE COMPUTER
SOCIETY CONFERENCE ON COMPUTER VISION AND
PATTERN RECOGNITION (CAT. N0.86CH2299-5) ,
MIAMI BEACH, Ft, USA, 22-26 JUNE 1986,
ISBN 0-8186-8721-1, 1986, WASHINGTON, DC,
USA, IEEE COMPUT. SOC. PRESS, USA,
pages 272-277, XP60057226a
TANAKA A ET At: "A rotation method for
raster Image using skew transformation*
see the whole document
1-17
A
PROCEEDINGS OF GRAPHICS INTERFACE '86 AND
VISION INTERFACE '86, VANCOUVER, BC,
CANADA, 26-30 MAY 1986, 1986, TORONTO,
ONT. , CANADA, CANADIAN INF. PROCESS. SOC,
CANADA,
pages 77-81. XP90Q572832
PAETH A W: "A fast algorithm for general
raster rotation"
see the whole document
1-17
A
EP.A.9 433 645 (XEROX CORP) 26 June 1991
Fm PCT/ISA/31t (auiMMfM of mmm* HUM) (July tttj)
page 2 of 2
INTERNATIONAL SEARCH REPORT
Pa/US 96/01418
dtad in mroh raport
Paunt ftmfly
m«nbtr(t)
PubttaUion
dm
WO-A-9529461 92-11-95 ' NONE
US-A- 5111192 05-95-92
CA-A- 2027458 21-96-91
JP-A- 3290765 29-12-91